Permutation tests to estimate significances on Principal Components Analysis

نویسنده

  • Vasco M. N. C. S. Vieira
چکیده

Principal Component Analysis is the most widely used multivariate technique to summarize information in a data collection with many variables. However, for it to be valid and useful the meaningful information must be retained and the noisy information must be sorted out. To achieve it an index from the original data set is estimated, after which three classes of methodologies may be used: (i) the analytical solution to the distribution of the index under the assumption the data has a multivariate normal distribution, (ii) the numerical solution to the distribution of the index by means of permutation tests without any assumption about the data distribution and (iii) the bootstrap numerical solution to the percentiles of the index and the comparison to its assumed value for the null hypothesis without any assumption about the data distribution. New indices are proposed to be used with permutation tests and compared with previous ones from application to several data sets. Their advantages and draw-backs are discussed together with the adequacy of permutation tests and inadequacy of both bootstrap techniques and methods that rely on the assumption of multivariate normal distributions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Construct Validity of the Reading Section of the University of Tehran English Proficiency Test

University of Tehran administers a test known as The University of Tehran English Proficiency Test (the UTEPT) to PhD candidates on a yearly basis. By definition, the test can be considered a high-stakes one. The validity of high stakes tests needs to be known (Roever, 2001). As Mesick (1988) maintains, if the validity of high stakes tests are not known, it might have some undesirable consequen...

متن کامل

Predicting the Young\'s Modulus and Uniaxial Compressive Strength of a typical limestone using the Principal Component Regression and Particle Swarm Optimization

In geotechnical engineering, rock mechanics and engineering geology, depending on the project design, uniaxial strength and static Youngchr('39')s modulus of rocks are of vital importance. The direct determination of the aforementioned parameters in the laboratory, however, requires intact and high-quality cores and preparation of their specimens have some limitations. Moreover, performing thes...

متن کامل

Choosing the Best Hierarchical Clustering Technique Based on Principal Components Analysis for Suspended Sediment Load Estimation

1- INTRODUCTION The assessment of watershed sediment load is necessary for controling soil erosion and reducing the potential of sediment production. Different estimates of sediment amounts along with the lack of long-term measurements limits the accessibility to reliable data series of erosion rate and sediment yield. Therefore, the observed data of suspended sediment load could be used to ...

متن کامل

Remote sensing of burned areas via PCA, Part 2: SVD-based PCA using MODIS and Landsat data

Background: Singular value decomposition (SVD), as an alternative solution to principal components analysis (PCA), may enhance the spectral profile of burned areas in satellite image composites. Methods: In this regard, we combine the pre-processing options of centering, non-centering, scaling, and non-scaling the input multi-spectral data, prior to the matrix decomposition, and treat their com...

متن کامل

Correction Scheme for Multiple Correlated Statistical Tests in Local Shape Analysis

In neuroimaging research shape analysis has become a field of great interest due to the ability to locate morphological brain changes between different groups. Currently, most local shape analysis approaches fail to correct for their high number of correlated statistical tests. This results in an overly optimistic estimate of the local shape analysis. This paper presents a correction scheme for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012